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Abstract — This letter introduces a novel channel coding design 
frameworli for short-length codewords that permits balancing the 
tradeoff between the bit error rate floor and waterfall region by 
modifying a single real-valued parameter. The proposed approach 
is based on combining convolutional coding with a g-ary linear 
combination and unequal energy allocation, the latter being 
controlled by the aforementioned parameter. EXIT charts are 
used to shed light on the convergence characteristics of the 
associated iterative decoder, which is described in terms of factor 
graphs. Simulation results show that the proposed scheme is able 
to adjust its end-to-end error rate performance efficiently and 
easily, on the contrary to previous approaches that require a full 
code redesign when the error rate requirements of the application 
change. Simulations also show that, at mid-range bit-error rates, 
there is a small performance penalty with respect to the previous 
approaches. However, the EXIT chart analysis and the simulation 
results suggest that for very low bit-error rates the proposed 
system will exhibit lower error floors than previous approaches. 



I. Introduction 

Iteratively decodable (i.e. Turbo-like) channel codes such 
as Low Density Parity -Check (LDPC) |1| or Turbo codes 1*21, 
have been widely shown to perform near capacity in point- 
to-point communications when used with codeword lengths 
beyond 10^. However, such codes can be impractical in many 
applications scenarios demanding low latencies (e.g. real-time 
video delivery), mainly due to their associated decoding com- 
plexity and limited technological resources of the underlying 
hardware. This rationale motivates the upsurge of research 
on short-length codes (i.e. codes with codewords of several 
hundreds to few thousands coded symbols) to the above 
scenarios. 

However, when dealing with short-length codewords the 
capacity-approaching performance of LDPC or Turbo codes 
may severely degrade due to the high error floors obtained 
under this condition ||3), Q. Several contributions have fo- 
cused on reducing these error floors such as |5|, where a novel 
design technique is proposed to produce short-length parity- 
check matrices for LDPC codes that lead to low error floors. 
Indeed, as outlined in |5| decreasing the error floor of LDPC 
codes is essential for potential applications such as data storage 
and deep-space communications, which require a Bit Error 
Rate (BER) as low as 10^^^. However, there are some other 
applications where higher error floors can be allowed (e.g. 
a Quasi-Error-Free system in DVB-T broadcasting requires a 
maximum BER of 10^''). In these applications the code design 
paradigm is not to reduce the error floor to its minimum, but 
to achieve the imposed BER limit at a signal to noise ratio 



(SNR) as low as possible. In this context, the well-known 
design tradeoff between the error floor and the BER waterfall 
region existing in Turbo-like codes (6\ becomes relevant. 
Unfortunately, balancing this tradeoff mainly depends on the 
design of the code itself (e.g. the parity-check matrix in LDPC 
codes), thus any change in the BER limit involves redesigning 
the code structure in its entirety. 

In this paper we propose an alternative channel coding 
approach for short-length codewords that can be easily adapted 
to any BER needs of a given application without requiring a 
full redesign of the code. That is, it can easily switch between 
high error floors and low-SNR waterfall regions (as those 
imposed by e.g. DVB-T), or minimum error floors and higher- 
SNR waterfall regions (e.g. deep space communications). The 
proposed code hinges on adding extra parity bits, generated by 
linearly combining the encoded bits over a non-binary Galois 
field, to the already convolutionally-encoded bits. Moreover, 
unequal energy allocation is used between the convolutionally- 
encoded symbols and the linearly-combined symbols, and 
EXIT charts are used to analyze the performance of the 
associated decoder and to optimize the coefficients of the 
linear combination. Simulation results will show that the afore- 
mentioned tradeoff can be easily balanced by only varying 
a single parameter, namely, the unequal energy allocation 
parameter A, hence the switch between regions can be done 
without changing the encoder and decoder 

The manuscript is organized as follows: Section [HI presents 
the system model, the proposed encoder and the corresponding 
decoder, whereas an analysis of the code based on EXIT charts 



is performed in Section III Next, Section IV discusses the 



obtained Monte Carlo simulation results and ends the paper 
by drawing some concluding remarks. 

II. Proposed Code 

In order to ease the understanding of the proposed code 
some definitions are first introduced. Consider the set of 
ah 2'^ polynomials p{z) of degree q — 1 with coefficients 
lying in GF{2) (the binary Galois field). Let g{z) be a 
prime polynomial (i.e., monic and irreducible polynomial) 
of order q. Then, this set becomes a finite field, GF(2'), 
by defining the addition ® and multiplication (g) rules as 
the mod g{z) remainder of the sum and product of two 
polynomials, respectively. Notice that, since the mod g{z) 
addition rule is just a componentwise addition of coefficients 
in GF{2), GF{2'') under addition is isomorphic to the vector 
space {GF{2)Y of binary g-tuples with mod 2 elementwise 



addition denoted hereafter as A. Therefore, there is a one- 
to-one mapping Vg : {GF{'2.)Y -J> GF(29) defined as 
ipq{aa, ..., ag_i) = Y.l=l "'kz'' such that tpq{a) ® ipq{h) = 
V'g(aAb), where a, b € {GF{2)Y. In addition, we index the 
elements pi e GF(29), i e {0, . . . , 2? - 1} by the base-10 
notation of the corresponding binary tuple (oq, . . . , flg-i)- In 
the following we refer as non-binary symbols to the elements 
of the finite field GF(29). 



El" = \Ef, with A > and define A = 10 log A (dB). Thus, 
we have 

^s - 2+A-C.s, E^ - 2+\^'^- (2) 




Fig. 1. Encoder associated to the proposed code. 

We consider a point-to-point scenario consisting of a binary 
unit-entropy information source S, which generates blocks 
U e {0,1}^^. As depicted in Figure [11 the sequence U 
is divided into two sequences U^ and U^ of length K, 
which are channel-coded by a terminated convolutional code, 
producing the codewords C" = {Crlili G {0,1}^, with 
TO € {li2}. Then, each codeword is interleaved yielding 
the interleaved codeword X™ — 11^ (C™), where Hi and 
112 are two different spread interleavers with a spread factor 
equal to q. Next, each of the interleaved coded sequences 
X^ and X^ is split into q-length sub-sequences {Xj^};^'^'' = 
{X/^, . . . , Xl^}fJ^^ and {X^j^l/^, respectively. We denote as 
V™ = V'glX™) G GF{2'^) the non-binary symbol associated 
to the corresponding sub-sequence: the non-binary symbol 
VJ^ is computed as the linear combination of the non-binary 
symbols VJ^ and V^^, i.e. 

where h = {h^,h?), h"^ G {pi}^^^^ represents the coef- 
ficients used in the linear combination. Each sub-sequence 
X^ is computed from the associated non-binary symbols V^^ 
as X^ = ipq'^iVp). In the following, we refer as Linear 
Combination (LC) code to the rate-2/3 code formed by the 
sub-codewords (X^-'^, X^, X^). Thus, there are N/q parallel LC 
codes. Finally, the sequence X"^ is given by X"^ = {Xf }j^j , 
and the final codeword X = {Xt}f;^i is formed by X = 
(X\ X^, X^). Thus, the overall code rate is given by 2K/3N. 
We apply unequal energy allocation between the sub- 
codewords associated to the terminated convolutional codes 
(X^ and X^) and the sub-codeword associated to the linear 
combination (X^). We denote a^Sf'^ = E[(A:™)2] the energy 
of the sub-codewords X^ and X^, whereas E'f = E[{X^)'^] 
and Es = E[Af^] denote the energy of X'^ and X, respec- 
tively. Furthermore, without loss of generality, we assume 

'E[-] stands for expected value. 



This results in the modulated symbols St = 2Xt — 1, 
with t ~ 1, . . . , 3iV, and average energy per symbol Eg- The 
received symbol per real dimension at the receiver is given by 
Yt = St + Nt, where {Nt}f;^i are modelled as real Gaussian 
i.i.d. random variables with zero mean and variance No/2. 



A. Proposed Decoder 

The destination receives the channel outputs Y = 
(Yi, Y2, Y^), where the subsequence Y^ (with j G {1, 2, 3}) 
represents the channel outputs associated to encoded symbols 
X-' . The aim of the decoder is to estimate the source binary 
symbols U = ({f/^}^li, {t/^}^i) so that the conditional 
probability P(u™|y) (m G {1,2}) - which is obtained by 
marginalizing the joint conditional probability P(u™|y) - is 
maximized. This marginalization is efficiently computed by 
applying the Sum-Product Algorithm (SPA, see |7|) to the 
factor graph describing P(u™|y), which is shown in Figure [2] 
Observe that such an overall factor graph is composed by three 
sub-factor graphs: two describing the convolutional codes, and 
a third one describing the LC code. Since this factor graph has 
loops, the SPA is iteratively run between the sub-factor graphs 
corresponding to the LC code and the convolutional codes. 
After a fixed number of iterations I, the probabihty P(M™|y) 
based on which t/™ is computed results proportional to 

ET^ /^^^ ^.^n „m „m \^f„rri\Qf„va \ I I ^ f ^m\ 



where ~ u™ — {wjjljvfc'^^fc; ct and f3 are the messages passed 
from the adjacent state nodes to the factor node Tk given 
by the transitions in the Trellis describing the convolutional 
code; c™ - {cn'^lK^-DN/K (^ith k e {!,..., K}) rep- 
resents the coded bits associated to m™; and 7(-) are the 
likelihoods passed from the variable nodes c™ to T^. These 
likelihoods depend on the messages passed by the LC-check 
nodes associated to the interleaved binary symbol a;™ i , 
i.e. 7(cJ") = 7(x™_i^^p, where 7(2:7') oc p(y;|a;J") and 

yi — (y/iy?,yf) (i-c- those components of Y associated to 
the LC check node LCi). It can be shown that such likelihoods 
can be further factorized as 

p(yrkr)n^"K^')^"W"), (3) 

where i G {1, . . . , g}, 1 [•] is an indicator function taking value 
1 if its argument is true and otherwise, and 
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expressed by the modulated symbols {Sf ^}1^i £ {il}' ^s 



with m = 3 — m. Since the non-binary symbols can be also 
2d 

that 



V/ =^, ({(1 + 5^/2}^ J ^0,({^/,}Li), 



we have 



p{yj\vj) (X ^ 1 



,«!,■ 



J|exp 



' i,q' 



1 1 

Via ~ ^ 



i,i 



i=l 



Nn 



(5) 



In light of the above factorization, it is clear to see that 
the factor graph of the LC code is in turn composed of 
N/q parallel and identical sub-factor graphs LCi, which are 
depicted, for / = 1 and I = N/q, in Figure [2] 



AWGN CHANNEL 




Fig. 2. Factor graph of the proposed decoder. 



III. Code Analysis through EXIT Charts 

The Extrinsic Information Transfer (EXIT) Q function 
of a code is defined as the relationship between the mutual 
information at the input of the decoder (commonly denoted 
as la) and the extrinsic mutual information /g at its output, 
i.e. le — T{Ia)- For an iterative code, the chart plotting 
the transfer functions of the compounding sub-codes is called 
EXIT chart, and is known to be a powerful tool for designing 
and predicting the behavior of iterative codes (see [8] and 
reference therein). 

We denote the transfer function of the LC-code for a given 
q and h as If = Tl{If). Notice, that for Pf = the 
value of If — r^(0) will also depend on the Signal-to-Noise 
Ratio (SNR), since the channel observations y are used by 
the LC-code (last term of (|5]l and Fig. |2]). As the mutual 
information at the input of the channel decoders If is equal 
to If, the extrinsic mutual information at the output of the 
convolutional decoders is given by If — T'^^{If). Thus, for 
a successful decoding procedure, there must be an open gap 
between both EXIT curves so that the iterative decoding can 
proceed from If — to If — 1. When both transfer functions 
cross, the iterative process will stop at a given extrinsic mutual 
information of the source bits If < 1. 



Since the transfer functions are monotonically increasing 
functions, the higher the value of T^(0) is, the smaller the 
required SNR to open a gap will be. On the other hand, if 
no crossing between curves has been produced, the higher the 
value of r^(l) is, the closer If will be to 1 yielding lower 
error floors (see Fig.l3]l. Therefore, we are interested on having 
high values of both T^(l) and r^(0). 

However, due to the Area Theorem of EXIT charts fSl, 
which states that the area under the transfer function depends 
only on the rate of the encoder, high values of T^(l) yield 
to low values of T^{0) and vice versa. Therefore, for a given 
q we select a h that maximizes r^(l) • 7^(0). For the sake 
of simplicity, this is done through exhaustive search over the 
(g^ — 1)^ possible values of h. 




(a) 




(b) 

Fig. 3. EXIT Chart of different LC codes and a rate- 1/2 Convolutional Code 
for different values of q (a) and A (b). 

Next, we justify the reason why the different linear combi- 
nations have a greater impact on the value of T^(l) than on 
r^(0). When If = all the information at the LC-Code 
comes equally from the channel observations associated to 
X\ X^ and X^. On the other hand, when If = I, only the 
information provided by the channel observations associated 
to X'^ is relevant, as the information regarding X^ and X^ is 
fully supplied by the convolutional decoders. 

Furthermore, variations on the unequal energy allocation 
parameter A are expected to affect to T^(l) and T'^(O) 
similarly, given that A operates identically on X^, X^ and 

X3. 



Figure 3(a) shows that as q increases, the value of T^{1) 
increases yielding lower error floors, but obtaining BER wa- 
terfall regions at higher SNRs. However, as shown in Fig. 



3(b) the tradeoff between the SNR at which waterfall regions 



occur and its associated BER floor can be balanced through 
the selection of A. By increasing A the value of T^{1) is 
increased, leading to low error floors. However, the value of 
r^(0) is decreased, hence obtaining BER waterfall regions at 
higher SNRs. 



IV. Simulation Results 

In order to assess the performance of the proposed code, 
several Monte Carlo simulations have been run using 6- 
memory-block [554, 774]8 non-systematic rate-1/2 convolu- 
tional codea^ A zero-bit tail is appended at the source se- 
quence in order to terminate the convolutional codes properly. 
Note that this added bit tail reduces the overall spectral 
efficiency of the system, specially with very short-length 
codewords. Therefore, it must be taken into account when 
computing the gap to the Shannon Limit. The interleavers 
Hm (■) have been randomly generated and are independent 
from each other. Finally, we have considered binary PAM 
modulation with transmitted blocks of 3A^ = 2000 + q real 
symbols, and I = 20 iterations of the decoding algorithm. 
The decoding is stopped at every simulated SNR point when 
100 errors have been obtained. 

Figure |4] depicts the end-to-end Bit Error Rate (BER) of 
the system for different values of q and A. Observe that when 
A is set beyond a certain threshold, the waterfall region is 
produced at high SNRs and with no error floor detected in the 
simulations. The reason being that in this case, early-crossing 
points in the associated EXIT chart are obtained at low values 
of r^(0); according to what was stated in Section 
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further increase on the value of r^(l) does not yield any 
iterative processing gain. Consequently, high error floors are 
obtained for low values of A. Also note that due to the low 
values of r^(l) resulting when q = 2, the system utilizing 
this value of h is clearly outperformed by that using q = 3, 
q = 4 and q = 5. 

Also included in these plots are reference curves corre- 
sponding to the PEG, PEG-ACSE and PEG-ACE schemes 
analyzed in |5, Fig. 6]. These schemes are based on carefully 
designed short-length, irregular LDPC codes with low error 
floors. Despite the good performance of these schemes in 
terms of error floor versus waterfall tradeoff, it should be em- 
phasized that any change in the specifications of the underlying 
application would require replacing the entire encoder and 
decoder, which might be impractical for a variety of scenarios. 
By contrast, the coding approach here proposed allows for 
balancing this tradeoff by simply varying A at a SNR waterfall 
degradation of less than 0.5 dB (at BER = 10"^) in all cases. 
Furthermore, the tradeoff can be easily set based on the EXIT 
chart associated to the LC code. 

In addition, both the EXIT chart analysis and the simulation 
results (Fig. |4(c)| and |4(d)| l suggest that for very low BERs 
(high SNRs) the proposed system will arise lower error floors 
than the irregular LDPC code proposed in |i5J. 




Gap Id Sha 

(a) q = 


nonL 

2. 






^% 


\ 


-□-,\=.1dB 

^^,\=-3dB 
-O-BERvsgap 
-0--PEG-ACE-9 
^*-PEG|51 


5|5| 




Gap to Shannon Limit (dB) 




Gap to Stiannon Limil 
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Fig. 4. BER performance of the proposed system with different values of q. 
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The subindex 8 in the definition of the code stands for octal. 



